Assembly and annotation of a BAC clone from a newly sequenced genome of Anopheles funestus
نویسندگان
چکیده
Whole genome sequencing of disease vectors such as Aedes aegypti, Anopheles gambiae, and recently Anopheles funestus are enabling scientists to identify and investigate regions involved in the genetic plasticity of these vectors in order to formulate new disease control strategies. In the current study, sequence data from a single Anopheles funestus BAC clone is examined in order to predict and annotate genes in this region. After cloning, cleaning contaminating sequences, and assembling a consensus BAC sequence, several prediction algorithms provided the data then used to refine the predictions by selecting only overlapping predicted genes. These genes and the assembled BAC sequence were compared to the Drosophila melanogaster and Anopheles gambiae annotated genomes found in the Ensembl database. Two genes of the Anopheles funestus BACwere found with high similarity in Drosophila melanogaster. Furthermore, these genes showed synteny across Anopheles funestus and Drosophila melanogaster by appearing in clusters in both organisms. A high degree of conservation of one of these genes was discovered when compared with Anopheles gambiae; this gene presented a 100% similarity with respect to the BAC sequence. Putative genes belonging to the KQT Potassium Voltage Gated Channel Subfamily were found to belong to a syntenic cluster conserved both in Drosophila melanogaster and in Anopheles gambiae.
منابع مشابه
De Novo Transcriptome Sequencing in Anopheles funestus Using Illumina RNA-Seq Technology
BACKGROUND Anopheles funestus is one of the primary vectors of human malaria, which causes a million deaths each year in sub-Saharan Africa. Few scientific resources are available to facilitate studies of this mosquito species and relatively little is known about its basic biology and evolution, making development and implementation of novel disease control efforts more difficult. The An. funes...
متن کاملSingle haplotype assembly of the human genome from a hydatidiform mole.
A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although inc...
متن کاملLong Read Sequencing Technology to Solve Complex Genomic Regions Assembly in Plants
During the last decade, we have observed remarkable advances in sequencing technology and bioinformatics analysis. The turning point came when the pyrosequencing technologies became available for the scientific community. Following Sanger’s method, pyrosequencing has provided a massive increase in sequencing throughput combined with a huge decrease in the cost per sequenced base. Thus, it becam...
متن کاملComparative genomic analysis in the region of a major Plasmodium-refractoriness locus of Anopheles gambiae.
We have sequenced six overlapping clones from a library of bacterial artificial chromosome (BAC) clones derived from a laboratory strain of the mosquito, Anopheles gambiae, the major vector of human malaria in Africa. The resulting uninterrupted 528-kb sequence is from the 8C region of the mosquito 2R chromosome, at or very near the major refractoriness locus associated with melanotic encapsula...
متن کاملUpgrading the DNA Sequence of the Rat Genome
The Brown Norway or laboratory rat (Rattus norvegicus) genome was sequenced in a project jointly supported by the NHGRI and the NHLBI (12). This was the third mammalian project undertaken by the NIH but the first that would only be taken to the draft stage. The project was a complex collaboration led by the BCM-HGSC (BAC skims, wgs, assembly, annotation, overall coordination) and including Cele...
متن کامل